Novel VTEO Based Mel Cepstral Features for Classification of Normal and Pathological Voices

نویسندگان

Hemant A. Patil

Pallavi N. Baljekar

چکیده

In this paper, novel Variable length Teager Energy Operator (VTEO) based Mel cepstral features, viz., VTMFCC are proposed for automatic classification of normal and pathological voices. Experiments have been carried out using this proposed feature set, MFCC and their score-level fusion. Classification was performed using a 2 order polynomial classifier on a subset of the MEEI database. The equal error rate (EER) on fusion was 3.2% less than EER of MFCC alone which was used as the baseline. Effectiveness of the proposed feature-set was also investigated under degraded conditions using the NOISEX-92 database for babble and high frequency channel noise.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Artificial Neural Network Based Pathological Voice Classification Using Mfcc Features

The analysis of pathological voice is a challenging and an important area of research in speech processing. Acoustic voice analysis can be used to characterize the pathological voices with the aid of the speech signals recorded from the patients. This paper presents a method for the identification and classification of pathological voice using Artificial Neural Network. Multilayer Perceptron Ne...

متن کامل

On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices.

This work presents a novel approach for the automatic detection of pathological voices based on fusing the information extracted by means of mel-frequency cepstral coefficients (MFCC) and features derived from the modulation spectra (MS). The system proposed uses a two-stepped classification scheme. First, the MFCC and MS features were used to feed two different and independent classifiers; and...

متن کامل

A two-stage approach using Gaussian mixture models and higher-order statistics for a classification of normal and pathological voices

A two-stage classifier is used to improve the classification performance between normal and pathological voices. A primary classification between normal and pathological voices is achieved by the Gaussian mixture model (GMM) log-likelihood scores. For samples that do not meet the thresholds for normal or disordered voice in the GMM, the final decision is made by a higher-order statistics (HOS)-...

متن کامل

Automatic classification of normal and abnormal cardiac sounds by combining features based on wavelet transform and capstral coefficients extracted from PCG signals (Research Article)

Cardiac sounds are produced by the mechanical activities of the heart and provide useful information about the function of the heart valves. Due to the transient and unstable nature of the heart's sound and the limitation of the human hearing system, it is difficult to categorize heart sound signals based on what is heard from a stethoscope. Therefore, providing an automated algorithm for prima...

متن کامل

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2011

Novel VTEO Based Mel Cepstral Features for Classification of Normal and Pathological Voices

نویسندگان

چکیده

منابع مشابه

Artificial Neural Network Based Pathological Voice Classification Using Mfcc Features

On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices.

A two-stage approach using Gaussian mixture models and higher-order statistics for a classification of normal and pathological voices

Automatic classification of normal and abnormal cardiac sounds by combining features based on wavelet transform and capstral coefficients extracted from PCG signals (Research Article)

Voice-based Age and Gender Recognition using Training Generative Sparse Model

عنوان ژورنال:

اشتراک گذاری